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Abstract 

The definition of time is still an open question when one deals with 
high frequency time series. If time is simply the calendar time, prices 
can be modeled as continuous random processes and values resulting 
from transactions or given quotes are discrete samples of this underly- 
ing dynamics. On the contrary, if one takes the business time point of 
view, price dynamics is a discrete random process, and time is simply 
the ordering according which prices are quoted in the market. In this 
paper we suggest that the business time approach is perhaps a better 
way of modeling price dynamics than calendar time. This conclusion 
comes out from testing probability densities and conditional variances 
predicted by the two models against the experimental ones. The data 
set we use contains the DEM/USD exchange quotes provided to us 
by Olsen & Associates during a period of one year from January to 
December 1998. In this period 1,620,843 quotes entries in the EFX 
system were recorded. 
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1 Introduction 



In the high-frequency arena there are two main-streams about modeling the 
stochastic properties of quotes. The first approach is to consider quotations 
as sampled values of an underlying continuous-time random process [S] , [7] . 
Sampling is itself a random operation, thus introducing a twofold uncertainty 
in the price determination jS], In this framework, time in the model 

flows continuously, and is called calendar time. 

In the second approach, quoted prices are modeled through a discrete-time 
stochastic process |12| : in this setting, time is just the natural total order 
relation among quotations, and it is iso-morphic with the set of non-negative 
integers (being time 0, the time associated to the first considered quotation). 
This is the business time approach, and randomness only enters in the 
determination of prices. It should be pointed out, however, that the waiting 
times between two quotes are also random quantities, but they are assumed 
to not contribute to the price determination process. 

Whether a calendar-time or a business-time framework should be adopted 
in modeling the stochastic nature of financial quotes, has been a longly de- 
bated issue by the finance research community, and it clearly depends on 
many factors, like, for example: a) adherence to the physical behavior of 
reported prices, b) usefulness in terms of a theory to be developed, and c) 
last but not least, a matter of taste. See, for example PQ, [S], 

In this paper, we suggest that business time is perhaps a better tool for 
modeling the asset dynamics than calendar-time. In order to support our 
claim, we consider: 1) returns corresponding to a given calendar time lag 
and any business time lag, 2) returns corresponding to the same calendar 
time lag but having a fixed business time lag. We find out that their statis- 
tical properties are different consistently with the business hypothesis and 
inconsistently with the calendar one. In practice, we estimate some vari- 
ances and some probability densities whose behavior is different in the two 
scenarios. 

The dataset we use contains the DEM/USD exchange quotes taken from 
Reuters' EFX pages (the dataset having been supplied by Olsen & Asso- 
ciates) during a period of one year from January to December 1998. In 
this period 1,620,843 quotes entries in the EFX system were recorded. The 
dataset provides a continuously updated sequence of bid and ask exchange 
quotation pairs from individual institutions whose names and locations are 



also recorded. The reason for using FX data is that this market is not sub- 
ject to any working time restriction; in fact, it is open 24 hours a day, seven 
days a week. This is in contrast to stock markets, where artificial time reg- 
ulation would have made more difficult, if not impossible, to find out the 
results outlined in this paper. 

2 Business time vs Calendar Time 
2.1 Calendar Time 

In the calendar time framework, prices are modeled as continuous-time ran- 
dom processes. Clearly, market quotes are not defined for every t £ K, but 
only at discrete intervals, whose extensions in time are called calendar lags 
(usually ranging from 2sec. to several minutes, sometimes hours). Never- 
theless, according to the calendar time picture, prices are usually considered 
as discrete samples of an underlying continuous-time random process. 

The model of price dynamics in calendar time therefore has the following 
structure: 



where S(t) and S(t + A) are the spot prices at times t and t + A, A is an 
arbitrary calendar time lag and i? a (t) is the aggregated return of prices over 
the time interval [t, t + A]. 

Considering a framework where prices evolve over the calendar time, it 
is generally assumed that quotes result from a random sampling at times 
to, . . . t n of the continuous-time underlying process S(t). In a pure calendar 
time framework such a random sampling is uncorrelated with the process 
S(t) itself. We observe however that this is only valid as an approximation; 
indeed, several studies have shown a weak correlation between the sequence 
of lags and that of returns, among which we cite |1U| . 

The last assumption usually made in order to complete the model de- 
scription in the calendar time setting is that the variance of -Ra (t) is a linear 
function of the calendar time lag A, i.e.: 



S(t + A) = S(t)e RA ® 




Var[R A (t)} 



= a 



2 A 




If the logarithm of S(t) has independent increments the above equation ob- 
viously holds and a is the constant volatility. However, it is well known that 
independence does not hold because of volatility clustering which is due to 



the correlation of the absolute values of returns jSj. As a consequence, in 
spite of a constant volatility, one has a time dependent volatility. Neverthe- 
less, the above behavior of the variance still holds true but a 2 is now the 
average of the squared volatility. For our purposes we only assume that the 
above equality holds and we do not need of specific assumptions concerning 
volatility behavior. 

Let us define the process M(t) as follows: 

M(t + A) = M(t)+M A (t) (3) 

where M A (t) represents the number of given quotes (samples) in the interval 
[t, t+A]. Clearly, M(t) is a non-decreasing random process assuming integer 
values. We also observe that M{t) as a function of t is piecewise constant, 
and its value increases by one each time a quote is given (i.e. at times 
to, . . . t n ). 

Given the assumptions made so far, it follows that the process M and the 
process S are mutually independent. Hence, it follows that the probability 
density of returns corresponding to a calendar time lag A is insensitive from 
the condition that M A {t) is also fixed to a value m. In symbols: 

P[R A (t)\M A (t) =m] = P[R A (t)] (4) 

and, in particular, the associated variance exhibits the same insensitiveness: 

Var[R A (t)\M A (t) = m] = Var[R A {t)\ =a 2 A (5) 
Therefore, we can summarize the calendar time hypothesis as follows: 

Hypothesis HI: The asset prices evolve over calendar time, i.e. according 
to the model in Eq. (^Q) and Eq. © holds. Moreover the processes S and M 
are mutually independent, therefore Eq. Q) and Eq. (J5J) hold. 

Let us anticipate that the main argument of the paper is based on the 
estimation of the quantities in Eq. (JIJ) and Eq. ©. We will show with 
enough evidence that the two equalities are largely violated in a way which, 
on the contrary, is consistent with the business framework. 

2.2 Business Time 

In the business-time approach, price dynamics is modeled as a discrete-time 
random process. Indeed, the time basis is the ordered sequence of times at 



which prices are quoted in the markets. It is therefore a set isomorphic with 
the set of non-negative integers. In such a framework the statistic model of 
price dynamics in the business-time framework is the following: 

S(n + m)=e Rm WS(n) (6) 

where S(n) and S(n + m) are the asset price at business times n and n + ra 
while R m (ri) is the aggregated return over m consecutive quotes. It is then 
clear that the only time-dependence affecting the price process is based on 
the global ordering of events while the return is independent from calendar 
lag. Notice that we refer to m as the business time lag as opposed to the 
calendar time lag A introduced in the previous section. 

Considering the price dynamics in a business time setting naturally leads 
to the following assumption: 

Var[R m (n)] = & 2 m (7) 

whose motivation is the same of that provided for the analogous assumption 
in the calendar time hypothesis. We also define the random process: 

T(n + m) = T(n) + T m (n) 

where T{n) is the stochastic calendar time at business time n and T m (n) 
corresponds to the calendar lag T(n + m) — T(n), i.e. the time elapsed from 
T(n) after the occurrence of m consecutive quotes. It can be readily seen 
that there is a direct connection between T(n) and the process M{t) defined 
in the previous subsection. In fact, M(t) = n with t G [T(n),T(n+ 1)), and, 
moreover, the following relation holds: 

M Tm(n) {T(n))=m 

for an arbitrary positive integer m. 

Given the assumption of statistical independence between S{n) and 
T(n), for a generic A the following relation holds: 

P[R m (n)\T m (n) E [A - e, A + e]] = P[R m (n)\ (8) 

where e is a fixed quantity. The above equation states that the probability 
density of returns corresponding to a business time lag m is insensitive to 
the condition that the calendar time lag is also fixed to a value around A. 
In particular we have for the variance: 

Var[R m {n)\T m {n) € [A - e, A + e]] = Var[R m (n)\ = a 2 m (9) 



which is the business time analogue of Eq. (J5J). 

Given all the assumptions made so far, we are ready to formulate the 
hypothesis of prices dynamics in a business time setting. 

Hypothesis H2: Asset prices follow the model in Eq. Q and Eq. 
holds. Moreover, the processes S and T are independent, it follows that 
Eq. © and Eq. @ hold. 

Before concluding this preliminary outline of the two basic approaches 
used to describe price dynamics (i.e. calendar time & business time) we also 
give another important property of some of the quantities involved so far, 
which will turn useful in the remaining part of the paper. 
With all the positions previously made, let us first observe that the following 
relation holds: 1 

E[M A (T(n))]=aA 

for a suitable constant a. Simply put, this property states that the expected 
value of the number of quotes in an interval A is proportional to A itself. 

Finally, considering the composition of the price process in business time 
and the process representing the number of quotes in a given calendar time 
lag A, it can be shown that: 

Var[R MA(tn) (n)] = a 2 E[M A (T(n))] = & 2 aA (10) 

Thus, in the business time hypothesis, we also expect the variance in (|1U|) 
to be proportional to A. 

As already anticipated, all equalities in this subsection are supported by 
the following statistical analysis confirming the validity the business time 
framework. 

3 Statistical Estimators 

In this and next section we carry out some experimental analysis in order to 

best fit the description of prices dynamics choosing between the two distinct 

possibilities concisely modeled by hypotheses HI and H2. 

1 This follows from the stationarity of the process M&{T(n)). In particular: 
£[Ma(T(«))] does not depend on T{n) so we drop the sub case. Moreover, E[MkA] = 
kE[M&\, since the average number of quotes in k intervals of the same length sums up to 
k times the value for the single interval, from which the proportionality follows. 



In this section, in particular, we will define some statistical estimators, 
i.e. functions of the data contained in high frequency time series, and relate 
them to their probabilistic counterparts defined in the previous section. 

Our dataset refers to the FX ratio USD/DM over the whole year 1998 
and the price Si we consider in this paper is the half sum of bid and ask 
(mid-price) while ti denote the time at which the i-th price is given. Some 
automatic filtering procedure is also applied, to remove erroneous recording, 
which we are able to individuate since they correspond to prices macroscop- 
ically different from previous and subsequent ones. 

Let 1Z = {ri} i=0 1 L be the series of elementary returns r, defined as: 

n = log-^l i = 0, 1,...,L 

and let T = {ri} i=0 x £ be the series of temporal lags defined as: t% = 

Now consider the series TZ(A, m) = {rj(A, m)} i=0 1 L ^ A m y, the rj(A, m) 
are obtained by summing m consecutive elementary returns (where m is 
fixed) and subsequently retaining only the L(A, m) sums corresponding to 
a lag in the interval [A — e, A + e] (i.e. the sum of the corresponding m 
elementary lags Tj is in the interval [A — e, A + e], where e is also a fixed 
quantity) . 

The mean and variance of such a series are respectively defined as: 

L(A,m) 

M (A,m) = T(K — j g n(A,m) 

I L(A,m) 

v(A,m) = — — r h(A,m) - /i(A,m)] 2 

-L{L\,m) 

We observe that v(A, m) represents an estimation of the quantity Var[R&{t)\M&{t) = 
m] for the calendar time model; and, as pointed out before, we expect it to 
be a linear function of A, should hypothesis HI be correct. Moreover, in 
this hypothesis, we expect this variance to be constant with respect to m if 
A is fixed. 

Alternatively, considering the business time framework, v(A, m) can also 
be seen an estimator of the quantity Var[R m (n)\T m (n) € [A — e, A + e]] 
defined in Eq. (J7J); should hypothesis H2 be correct we expect, given m, that 
v(A,m) is approximately constant with respect to A. Moreover, in this 
hypothesis, we expect this variance to be linear in m even if A is fixed. 



With the same set of data 1Z(A,m) we can can compute the empirical 
pdf of returns with fixed A and with fixed m. This pdf is an estimator of 
P[R A {t)\M A (t) = m] and also of P[R m {n)\T m {n) G [A — e, A + e]]. 

Consider now the series 1Z(A) = {^(A)} i=01 l(A) obtained from 1Z 
by summing consecutive elementary returns until the corresponding lag be- 
comes equal or greater than A. The number of the elements of this series is 
L(A) and the mean and variance are respectively defined as: 

1 MA) 

MA) = m E^) 

x ' i=l 

1 HA) 

v(A) = £[ ri (A)-/,(A)] 2 

In the calendar time framework, v(A) estimates the quantity Var[R A (t)], 
defined in Eq. In the business time case, instead, v(A) estimates the 
quantity Var[R MA ^(n)] in Eq. In both cases we expect this quantity 
to grow linearly with A. 

With the same set of data TZ(A) we can can compute the empirical pdf 
of returns with fixed A (any m). This pdf is an estimator of P[R A (t)]. 

4 The choice of the correct model from data anal- 
ysis 

We have now sufficient information in order to accept or discard hypothesis 
HI and H2, as a result of an empirical data analysis. 

First, we have computed the statistical estimators v (A) and v(A, m = 
40) as defined in the previous section and both plotted in Fig. ^ f° r differ- 
ent values of the calendar time lag A. It can be readily seen that while 
v(A) varies linearly with A, the quantity v(A,m) is approximately con- 
stant. Indeed, a linear fit was computed in the first case resulting in 
v = 6.16E — 10A + 8.26E — 8 and a constant fit in the second resulting 
in v = A.83E - 7. 

We recall that, according to the calendar time hypothesis the two lines 
should be equal and proportional to A, while in the business time case the 
former should be proportional to A, while the latter should be constant. 
The corresponding graphs in fig. ^ seem to suggest that the business time 
model is more likely valid, while the hypothesis of calendar time dynamics 
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Figure 1: We plot here the statistical estimators v(A) (+ symbols) and v(A,m) with 
m = 40 (x symbols) for different values of the calendar time lag A. It can be readily seen 
that while v(A) varies linearly with A, the quantity v(A,m) is approximately constant. 
Therefore, if the business time lag is fixed (at m = 40), the variance of the returns does 
not scale with time lag A. This would indicate that business time lag rather than calendar 
time lag forms the important independent variable. A linear fit was computed in the first 
case resulting in v — 6.16E — 10 A + 8.26.E — 8 and a constant fit in the second resulting 
in v = 4.83B - 7. 
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Figure 2: We plot here the statistical estimator v(A,m) with a fixed A = 1000 ± 50 
for different values of the business time lag m. I can be seen that v(A, m) grows with m 
(even if not linearly in all range considered). 



seems to be unlikely. The same kind of behavior can be found if one chooses 
the value of m in a range between 5 and 100. 

In fig. |2 we also plot the statistical estimator v(A,m) versus m with a 
fixed A = 1000±50. According to the calendar time hypothesis this quantity 
should be constant while, according to the business time hypothesis , should 
grow linearly in m. The behavior is not linear in all range but, anyway, 
v(A,m) grows with respect to m, which also supports the business time 
hypothesis. It should be noticed that the choice of other values of the fixed 
A would not alter this picture. 



Second, we consider two distinct series of returns TZ(A, m) and 7Z(A) 
(respectively a and b) as defined in the previous section. 

Since the minimum lag between two consecutive quotes is equal to 2 
seconds in the given database, the two series a and b coincide for A = 2 sec; 
formally: 1Z(A = 2sec,m = 1) = 1Z(A = 2sec). 

We have subsequently compared the estimated probability density func- 
tions (pdf) for the series a and b and the results are shown in fig. |31 

The figure is a log-linear plot of different probability densities, For A = 
2sec the pdf the two cases TZ(A = 2sec) and 1Z(A = 2sec, m = 1) exactly 
coincide because of the data set characteristics as just explained. For A = 




0.01 



Figure 3: Estimated probability density functions for 1Z(A = 2sec) = 1Z(A = 2sec, m = 
1), 1Z(A = lOOsec) and TZ(A = lOOsec, m = 1) in a log-linear plot. The first two pdf (+ 
symbols) coincide because of the data set characteristics as explained in the text; the pdf 
for 1Z(A — lOOsec, m — 1) (x symbols) is roughly the same of the first two while the 
pdf for TZ(A = lOOsec) (star symbols) is macroscopically different having larger moments. 
The significance of the plots lies in the fact that if m = 1, then a large calendar time 
seems to make no difference, whereas if m is allowed to vary, then the PDF becomes fat, 
due to return aggregation. 



lOOsec we observe a remarkable difference between the pdf for the series 
1Z(A = lOOsec, m = 1) and 7£(A = lOOsec). The former, in fact, is roughly 
the same as 7£(A = 2sec), while the second is fatter (larger moments). 

This fact disagrees with Eq. © which is a consequence of calendar time 
hypothesis. In fact, according to this equation the two pdf corresponding to 
7£(A = lOOsec, m = 1) and 7Z(A = lOOsec) should be equal. 

On the contrary, one can immediately see that this result is in accordance 
with Eq. and, therefore, with business time hypothesis. In fact, 7Z(A = 
lOOsec, m = 1) and 1Z{A = 2sec,m = 1) are roughly the same. This 
experimental equality simply means that given the value of m returns are 
substantially insensitive to A as stated in Eq. (JSJ). 

In conclusion, this experimental result provides further evidence that the 
correct model should be the one of the process evolving over business time 
(hypothesis H2). 

5 Conclusions 

In this paper we suggest that the business time approach is perhaps a better 
way of modeling price dynamics than calendar time. In order to derive some 
insight from data we neglect possible autocorrelation between returns and 
possible autocorrelation between lags assuming implicitly that they would 
only give a second order correction to our findings. With this simplification 
our results altogether seem to provide enough evidence for the rejection of 
hypothesis HI (calendar time model) and the acceptance of hypothesis H2 
(business time model). Nevertheless, it should be noticed that hypothesis 
HI assumes that the sampling process is independent of the price evolution. 
Therefore, our results do not rule out the continuous time model, but rather 
they show that the the continuous time model would require correlations 
between processes M and S to in order to fit the data. 

The deep reason of the behavior we point out in this paper is that when 
an asset (at least a for ex asset) is not traded, the prices evolution is slow 
while the evolution is fast when the asset is heavily traded. A faster evolution 
corresponds to a larger volatility in calendar time 011], therefore, one could 
even maintain the calendar point of view, but in this case it should accept 
a seasonal modulation of volatility. The fact that the evolution of a price is 
slow when there are few transactions is very well known to practitioners but 
it is still not accepted in its extremal consequence that prices are frozen when 



assets are not traded at all. This is because this behavior is in contrast to 
the stock market experience where opening prices are different from previous 
night closing prices. Nevertheless the difference between the two markets 
is not astonishing if one thinks that the stock market is artificially time 
regulated, while the forex exchange market is an over the counter (OTC) 
market not subject to any time restriction. 
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